112. Structure from sequence: A view based on a global organization of the protein space
نویسندگان
چکیده
Introduction An essential step in any large-scale structural and functional genomic projects is to assign information to new un-annotated sequences. The flood of sequences accumulating from genome projects argues that a pressing need in structural and functional prediction efforts are automatic methods to bridge sequence and structural information. An unbiased set of structural representatives is used herein for determining the limitation of sequence-based information towards structural similarity [1]. It was shown that pairs of protein having >30-35% sequence identity (over long sequence), almost always share structural similarity. Below that ‘safe’ level the noise dominates, and for all protein pairs sharing at most 25% sequence identity only ~10% are indeed structural homologues. Below the 20% sequence identity, sequence based methods shown to perform poorly. Comparing advanced search programs for sequence-based structural predictions evaluated the limits of accurate identifications for proteins with low sequence significance scores [2]. Herein, we show that a hierarchical clustering of the entire protein space, based on sequence alone, manages to enrich the proportion of those pairs of proteins that are indeed structural homologues from within a set of protein pairs that share < 25% sequence identity.
منابع مشابه
رمزگشایی از فضامندی پدیده جهانی شدن با تکیه بر شاکله مفهومی تریالکتیک لوفور
Introduction This survey is based on deciphering the space of globalization phenomenon and revealing affected by spatial actions mechanism under investment discipline (according to compressed space and time). The basic question is how the space could be used, organized and created so that it could be in line with investment flow progress necessities. This can be unrevealed by applying concep...
متن کاملTowards a Complete Map of the Protein Space Based on a Unified Sequence and Structure Analysis of All Known Proteins
In search for global principles that may explain the organization of the space of all possible proteins, we study all known protein sequences and structures. In this paper we present a global map of the protein space based on our analysis. Our protein space contains all protein sequences in a non-redundant (NR) database, which includes all major sequence databases. Using the PSI-BLAST procedure...
متن کاملTowards a complete map of the protein space based on a uni ed sequence and structure analysis of all known proteins
In search for global principles that may explain the organization of the space of all possible proteins, we study all known protein sequences and structures. In this paper we present a global map of the protein space based on our analysis. Our protein space contains all protein sequences in a non-redundant (NR) database, which includes all major sequence databases. Using the PSI-BLAST procedure...
متن کاملRelation Between RNA Sequences, Structures, and Shapes via Variation Networks
Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...
متن کاملتبیین الگوی نااقلیدسی در برنامه ریزی شهری
With domination of Kant's epistemology and instrumental reason in social science and human geography, interpretation of space have been based on neo physics that often it is equivalent with intuitive and physical experience and the place of capital and it's reproduction. Therefore we firstly have represented of ontological transform of space concept and by the way we enumerate the c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003